Biblioteca Digital

28 resultados para Chloroplast genome

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo

Whole genome sequencing analysis of Plasmodium vivax using whole genome capture

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Malaria caused by Plasmodium vivax is an experimentally neglected severe disease with a substantial burden on human health. Because of technical limitations, little is known about the biology of this important human pathogen. Whole genome analysis methods on patient-derived material are thus likely to have a substantial impact on our understanding of P. vivax pathogenesis and epidemiology. For example, it will allow study of the evolution and population biology of the parasite, allow parasite transmission patterns to be characterized, and may facilitate the identification of new drug resistance genes. Because parasitemias are typically low and the parasite cannot be readily cultured, on-site leukocyte depletion of blood samples is typically needed to remove human DNA that may be 1000X more abundant than parasite DNA. These features have precluded the analysis of archived blood samples and require the presence of laboratories in close proximity to the collection of field samples for optimal pre-cryopreservation sample preparation. Results: Here we show that in-solution hybridization capture can be used to extract P. vivax DNA from human contaminating DNA in the laboratory without the need for on-site leukocyte filtration. Using a whole genome capture method, we were able to enrich P. vivax DNA from bulk genomic DNA from less than 0.5% to a median of 55% (range 20%-80%). This level of enrichment allows for efficient analysis of the samples by whole genome sequencing and does not introduce any gross biases into the data. With this method, we obtained greater than 5X coverage across 93% of the P. vivax genome for four P. vivax strains from Iquitos, Peru, which is similar to our results using leukocyte filtration (greater than 5X coverage across 96% of the genome). Conclusion: The whole genome capture technique will enable more efficient whole genome analysis of P. vivax from a larger geographic region and from valuable archived sample collections.

Whole-genome sequencing of the efficient industrial fuel-ethanol fermentative Saccharomyces cerevisiae strain CAT-1

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Saccharomyces cerevisiae strains widely used for industrial fuel-ethanol production have been developed by selection, but their underlying beneficial genetic polymorphisms remain unknown. Here, we report the draft whole-genome sequence of the S. cerevisiae strain CAT-1, which is a dominant fuel-ethanol fermentative strain from the sugarcane industry in Brazil. Our results indicate that strain CAT-1 is a highly heterozygous diploid yeast strain, and the similar to 12-Mb genome of CAT-1, when compared with the reference S228c genome, contains similar to 36,000 homozygous and similar to 30,000 heterozygous single nucleotide polymorphisms, exhibiting an uneven distribution among chromosomes due to large genomic regions of loss of heterozygosity (LOH). In total, 58 % of the 6,652 predicted protein-coding genes of the CAT-1 genome constitute different alleles when compared with the genes present in the reference S288c genome. The CAT-1 genome contains a reduced number of transposable elements, as well as several gene deletions and duplications, especially at telomeric regions, some correlated with several of the physiological characteristics of this industrial fuel-ethanol strain. Phylogenetic analyses revealed that some genes were likely associated with traits important for bioethanol production. Identifying and characterizing the allelic variations controlling traits relevant to industrial fermentation should provide the basis for a forward genetics approach for developing better fermenting yeast strains.

Diversification of hAT transposase paralogues in the sugarcane genome

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Transposons are abundant components of eukaryotic genomes, and play important role in genome evolution. The knowledge about these elements should contribute to the understanding of their impact on the host genomes. The hAT transposon superfamily is one of the best characterized superfamilies in diverse organisms, nevertheless, a detailed study of these elements was never carried in sugarcane. To address this question we analyzed 32 cDNAs similar to that of hAT superfamily of transposons previously identified in the sugarcane transcriptome. Our results revealed that these hAT-like transposases cluster in one highly homogeneous and other more heterogeneous lineage. We present evidences that support the hypothesis that the highly homogeneous group is a domesticated transposase while the remainder of the lineages are composed of transposon units. The first is common to grasses, clusters significantly with domesticated transposases from Arabidopsis, rice and sorghum and is expressed in different tissues of two sugarcane cultivars analyzed. In contrast, the more heterogeneous group represents at least two transposon lineages. We recovered five genomic versions of one lineage, characterizing a novel transposon family with conserved DDE motif, named SChAT. These results indicate the presence of at least three distinct lineages of hAT-like transposase paralogues in sugarcane genome, including a novel transposon family described in Saccharum and a domesticated transposase. Taken together, these findings permit to follow the diversification of some hAT transposase paralogues in sugarcane, aggregating knowledge about the co-evolution of transposons and their host genomes.

Assignment of Putative Functions to Membrane "Hypothetical Proteins" from the Trypanosoma cruzi Genome

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Protozoan parasites cause thousands of deaths each year in developing countries. The genome projects of these parasites opened a new era in the identification of therapeutic targets. However, the putative function could be predicted for fewer than half of the protein-coding genes. In this work, all Trypanosoma cruzi proteins containing predicted transmembrane spans were processed through an automated computational routine and further analyzed in order to assign the most probable function. The analysis consisted of dissecting the whole predicted protein in different regions. More than 5,000 sequences were processed, and the predicted biological functions were grouped into 19 categories according to the hits obtained after analysis. One focus of interest, due to the scarce information available on trypanosomatids, is the proteins involved in signal-transduction processes. In the present work, we identified 54 proteins belonging to this group, which were individually analyzed. The results show that by means of a simple pipeline it was possible to attribute probable functions to sequences annotated as coding for "hypothetical proteins.'' Also, we successfully identified the majority of candidates participating in the signal-transduction pathways in T. cruzi.

Genome Sequence of the Bacterioplanktonic, Mixotrophic Vibrio campbellii Strain PEL22A, Isolated in the Abrolhos Bank

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Vibrio campbellii PEL22A was isolated from open ocean water in the Abrolhos Bank. The genome of PEL22A consists of 6,788,038 bp (the GC content is 45%). The number of coding sequences (CDS) is 6,359, as determined according to the Rapid Annotation using Subsystem Technology (RAST) server. The number of ribosomal genes is 80, of which 68 are tRNAs and 12 are rRNAs. V. campbellii PEL22A contains genes related to virulence and fitness, including a complete proteorhodopsin cluster, complete type II and III secretion systems, incomplete type I, IV, and VI secretion systems, a hemolysin, and CTX Phi.

Structural and Functional Insights from the Metagenome of an Acidic Hot Spring Microbial Planktonic Community in the Colombian Andes

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A taxonomic and annotated functional description of microbial life was deduced from 53 Mb of metagenomic sequence retrieved from a planktonic fraction of the Neotropical high Andean (3,973 meters above sea level) acidic hot spring El Coquito (EC). A classification of unassembled metagenomic reads using different databases showed a high proportion of Gammaproteobacteria and Alphaproteobacteria (in total read affiliation), and through taxonomic affiliation of 16S rRNA gene fragments we observed the presence of Proteobacteria, micro-algae chloroplast and Firmicutes. Reads mapped against the genomes Acidiphilium cryptum JF-5, Legionella pneumophila str. Corby and Acidithiobacillus caldus revealed the presence of transposase-like sequences, potentially involved in horizontal gene transfer. Functional annotation and hierarchical comparison with different datasets obtained by pyrosequencing in different ecosystems showed that the microbial community also contained extensive DNA repair systems, possibly to cope with ultraviolet radiation at such high altitudes. Analysis of genes involved in the nitrogen cycle indicated the presence of dissimilatory nitrate reduction to N2 (narGHI, nirS, norBCDQ and nosZ), associated with Proteobacteria-like sequences. Genes involved in the sulfur cycle (cysDN, cysNC and aprA) indicated adenylsulfate and sulfite production that were affiliated to several bacterial species. In summary, metagenomic sequence data provided insight regarding the structure and possible functions of this hot spring microbial community, describing some groups potentially involved in the nitrogen and sulfur cycling in this environment. Citation: Jimenez DJ, Andreote FD, Chaves D, Montana JS, Osorio-Forero C, et al. (2012) Structural and Functional Insights from the Metagenome of an Acidic Hot Spring Microbial Planktonic Community in the Colombian Andes. PLoS ONE 7(12): e52069. doi:10.1371/journal.pone.0052069

Genome Sequence of Exiguobacterium antarcticum B7, Isolated from a Biofilm in Ginger Lake, King George Island, Antarctica

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Exiguobacterium antarcticum is a psychotropic bacterium isolated for the first time from microbial mats of Lake Fryxell in Antarctica. Many organisms of the genus Exiguobacterium are extremophiles and have properties of biotechnological interest, e. g., the capacity to adapt to cold, which make this genus a target for discovering new enzymes, such as lipases and proteases, in addition to improving our understanding of the mechanisms of adaptation and survival at low temperatures. This study presents the genome of E. antarcticum B7, isolated from a biofilm sample of Ginger Lake on King George Island, Antarctic peninsula.

DNA repair mechanisms protect our genome from carcinogenesis

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Human cells are constantly exposed to DNA damage. Without repair, damage can result in genetic instability and eventually cancer. The strong association between the lack of DNA damage repair, mutations and cancer is dramatically demonstrated by a number of cancer-prone human syndromes, such as xeroderma pigmentosum (XP), ataxia-telangiectasia (AT) and Fanconi anemia (FA). This review focuses on the historical discoveries related with these three diseases and describes their impact on the understanding of DNA repair mechanisms and the causes of human cancer. As deficiencies in DNA repair are also often related with progeria symptoms, unrepaired damage and aging are somehow related. Several other pathologies associated with DNA repair defects, genetic instability and increased cancer risk are also discussed. In fact, studies with cells from these many syndromes have helped in understanding important levels of protection against cancer and aging, although little help has actually been conferred to the patients in terms of therapy. Finally, the recent advances in combined basic and translational research on DNA repair and chemotherapy are presented.

Proteomic Analysis of Chloroplast-to-Chromoplast Transition in Tomato Reveals Metabolic Shifts Coupled with Disrupted Thylakoid Biogenesis Machinery and Elevated Energy-Production Components

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A comparative proteomic approach was performed to identify differentially expressed proteins in plastids at three stages of tomato (Solanum lycopersicum) fruit ripening (mature-green, breaker, red). Stringent curation and processing of the data from three independent replicates identified 1,932 proteins among which 1,529 were quantified by spectral counting. The quantification procedures have been subsequently validated by immunoblot analysis of six proteins representative of distinct metabolic or regulatory pathways. Among the main features of the chloroplast-to-chromoplast transition revealed by the study, chromoplastogenesis appears to be associated with major metabolic shifts: (1) strong decrease in abundance of proteins of light reactions (photosynthesis, Calvin cycle, photorespiration) and carbohydrate metabolism (starch synthesis/degradation), mostly between breaker and red stages and (2) increase in terpenoid biosynthesis (including carotenoids) and stress-response proteins (ascorbate-glutathione cycle, abiotic stress, redox, heat shock). These metabolic shifts are preceded by the accumulation of plastid-encoded acetyl Coenzyme A carboxylase D proteins accounting for the generation of a storage matrix that will accumulate carotenoids. Of particular note is the high abundance of proteins involved in providing energy and in metabolites import. Structural differentiation of the chromoplast is characterized by a sharp and continuous decrease of thylakoid proteins whereas envelope and stroma proteins remain remarkably stable. This is coincident with the disruption of the machinery for thylakoids and photosystem biogenesis (vesicular trafficking, provision of material for thylakoid biosynthesis, photosystems assembly) and the loss of the plastid division machinery. Altogether, the data provide new insights on the chromoplast differentiation process while enriching our knowledge of the plant plastid proteome.

SIS: a program to generate draft genome sequence scaffolds for prokaryotes

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Decreasing costs of DNA sequencing have made prokaryotic draft genome sequences increasingly common. A contig scaffold is an ordering of contigs in the correct orientation. A scaffold can help genome comparisons and guide gap closure efforts. One popular technique for obtaining contig scaffolds is to map contigs onto a reference genome. However, rearrangements that may exist between the query and reference genomes may result in incorrect scaffolds, if these rearrangements are not taken into account. Large-scale inversions are common rearrangement events in prokaryotic genomes. Even in draft genomes it is possible to detect the presence of inversions given sufficient sequencing coverage and a sufficiently close reference genome. Results: We present a linear-time algorithm that can generate a set of contig scaffolds for a draft genome sequence represented in contigs given a reference genome. The algorithm is aimed at prokaryotic genomes and relies on the presence of matching sequence patterns between the query and reference genomes that can be interpreted as the result of large-scale inversions; we call these patterns inversion signatures. Our algorithm is capable of correctly generating a scaffold if at least one member of every inversion signature pair is present in contigs and no inversion signatures have been overwritten in evolution. The algorithm is also capable of generating scaffolds in the presence of any kind of inversion, even though in this general case there is no guarantee that all scaffolds in the scaffold set will be correct. We compare the performance of SIS, the program that implements the algorithm, to seven other scaffold-generating programs. The results of our tests show that SIS has overall better performance. Conclusions: SIS is a new easy-to-use tool to generate contig scaffolds, available both as stand-alone and as a web server. The good performance of SIS in our tests adds evidence that large-scale inversions are widespread in prokaryotic genomes.

Whole blood genome-wide gene expression profile in males after prolonged wakefulness and sleep recovery

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Pellegrino R, Sunaga DY, Guindalini C, Martins RC, Mazzotti DR, Wei Z, Daye ZJ, Andersen ML, Tufik S. Whole blood genome-wide gene expression profile in males after prolonged wakefulness and sleep recovery. Physiol Genomics 44: 1003-1012, 2012. First published September 4, 2012; doi: 10.1152/physiolgenomics.00058.2012.-Although the specific functions of sleep have not been completely elucidated, the literature has suggested that sleep is essential for proper homeostasis. Sleep loss is associated with changes in behavioral, neurochemical, cellular, and metabolic function as well as impaired immune response. Using high-resolution microarrays we evaluated the gene expression profiles of healthy male volunteers who underwent 60 h of prolonged wakefulness (PW) followed by 12 h of sleep recovery (SR). Peripheral whole blood was collected at 8 am in the morning before the initiation of PW (Baseline), after the second night of PW, and one night after SR. We identified over 500 genes that were differentially expressed. Notably, these genes were related to DNA damage and repair and stress response, as well as diverse immune system responses, such as natural killer pathways including killer cell lectin-like receptors family, as well as granzymes and T-cell receptors, which play important roles in host defense. These results support the idea that sleep loss can lead to alterations in molecular processes that result in perturbation of cellular immunity, induction of inflammatory responses, and homeostatic imbalance. Moreover, expression of multiple genes was downregulated following PW and upregulated after SR compared with PW, suggesting an attempt of the body to re-establish internal homeostasis. In silico validation of alterations in the expression of CETN3, DNAJC, and CEACAM genes confirmed previous findings related to the molecular effects of sleep deprivation. Thus, the present findings confirm that the effects of sleep loss are not restricted to the brain and can occur intensely in peripheral tissues.

Comparative Genome Analysis of Trichophyton rubrum and Related Dermatophytes Reveals Candidate Genes Involved in Infection

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The major cause of athlete's foot is Trichophyton rubrum, a dermatophyte or fungal pathogen of human skin. To facilitate molecular analyses of the dermatophytes, we sequenced T. rubrum and four related species, Trichophyton tonsurans, Trichophyton equinum, Microsporum canis, and Microsporum gypseum. These species differ in host range, mating, and disease progression. The dermatophyte genomes are highly colinear yet contain gene family expansions not found in other human-associated fungi. Dermatophyte genomes are enriched for gene families containing the LysM domain, which binds chitin and potentially related carbohydrates. These LysM domains differ in sequence from those in other species in regions of the peptide that could affect substrate binding. The dermatophytes also encode novel sets of fungus-specific kinases with unknown specificity, including nonfunctional pseudokinases, which may inhibit phosphorylation by competing for kinase sites within substrates, acting as allosteric effectors, or acting as scaffolds for signaling. The dermatophytes are also enriched for a large number of enzymes that synthesize secondary metabolites, including dermatophyte-specific genes that could synthesize novel compounds. Finally, dermatophytes are enriched in several classes of proteases that are necessary for fungal growth and nutrient acquisition on keratinized tissues. Despite differences in mating ability, genes involved in mating and meiosis are conserved across species, suggesting the possibility of cryptic mating in species where it has not been previously detected. These genome analyses identify gene families that are important to our understanding of how dermatophytes cause chronic infections, how they interact with epithelial cells, and how they respond to the host immune response. IMPORTANCE Athlete's foot, jock itch, ringworm, and nail infections are common fungal infections, all caused by fungi known as dermatophytes (fungi that infect skin). This report presents the genome sequences of Trichophyton rubrum, the most frequent cause of athlete's foot, as well as four other common dermatophytes. Dermatophyte genomes are enriched for four gene classes that may contribute to the ability of these fungi to cause disease. These include (i) proteases secreted to degrade skin; (ii) kinases, including pseudokinases, that are involved in signaling necessary for adapting to skin; (iii) secondary metabolites, compounds that act as toxins or signals in the interactions between fungus and host; and (iv) a class of proteins (LysM) that appear to bind and mask cell wall components and carbohydrates, thus avoiding the host's immune response to the fungi. These genome sequences provide a strong foundation for future work in understanding how dermatophytes cause disease.

Non-classical HLA-E gene variability in Brazilians: a nearly invariable locus surrounded by the most variable genes in the human genome

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The non-classical human leukocyte antigen (HLA) class I genes present a very low rate of variation. So far, only 10 HLA-E alleles encoding three proteins have been described, but only two are frequently found in worldwide populations. Because of its historical background, Brazilians are very suitable for population genetic studies. Therefore, 104 bone marrow donors from Brazil were evaluated for HLA-E exons 14. Seven variation sites were found, including two known single nucleotide polymorphisms (SNPs) at positions +424 and +756 and five new SNPs at positions +170 (intron 1), +1294 (intron 3), +1625, +1645 and +1857 (exon 4). Haplotyping analysis did show eight haplotypes, three of them known as E*01:01:01, E*01:03:01 and E*01:03:02:01 and five HLA-E new alleles that carry the new variation sites. The HLA-E*01:01:01 allele was the predominant haplotype (62.50%), followed by E*01:03:02:01 (24.52%). Selective neutrality tests have disclosed an interesting pattern of selective pressures in which balancing selection is probably shaping allele frequency distributions at an SNP at exon 3 (codon 107), sequence diversity at exon 4 and the non-coding regions is facing significant purifying pressure. Even in an admixed population such as the Brazilian one, the HLA-E locus is very conserved, presenting few polymorphic SNPs in the coding region.

Genome-Wide Analysis in Brazilian Xavante Indians Reveals Low Degree of Admixture

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Characterization of population genetic variation and structure can be used as tools for research in human genetics and population isolates are of great interest. The aim of the present study was to characterize the genetic structure of Xavante Indians and compare it with other populations. The Xavante, an indigenous population living in Brazilian Central Plateau, is one of the largest native groups in Brazil. A subset of 53 unrelated subjects was selected from the initial sample of 300 Xavante Indians. Using 86,197 markers, Xavante were compared with all populations of HapMap Phase III and HGDP-CEPH projects and with a Southeast Brazilian population sample to establish its population structure. Principal Components Analysis showed that the Xavante Indians are concentrated in the Amerindian axis near other populations of known Amerindian ancestry such as Karitiana, Pima, Surui and Maya and a low degree of genetic admixture was observed. This is consistent with the historical records of bottlenecks experience and cultural isolation. By calculating pair-wise F-st statistics we characterized the genetic differentiation between Xavante Indians and representative populations of the HapMap and from HGDP-CEPH project. We found that the genetic differentiation between Xavante Indians and populations of Ameridian, Asian, European, and African ancestry increased progressively. Our results indicate that the Xavante is a population that remained genetically isolated over the past decades and can offer advantages for genome-wide mapping studies of inherited disorders.

Is a Genome a Codeword of an Error-Correcting Code?

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.

«
1
2
»